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Abstract. Constraint logic grammars provide a powerful formalism for 
expressing complex logical descriptions of natural language phenomena 
in exact terms. Describing some of these phenomena may, however, re- 
quire some form of graded distinctions which are not provided by such 
grammars. Recent approaches to weighted constraint logic grammars at- 
tempt to address this issue by adding numerical calculation schemata to 
the deduction scheme of the underlying CLP framework. 
Currently, these extralogical extensions are not related to the model- 
theoretic counterpart of the operational semantics of CLP, i.e., they do 
not come with a formal semantics at all. 

The aim of this paper is to present a clear formal semantics for weighted 
constraint logic grammars, which abstracts away from specific interpreta- 
tions of weights, but nevertheless gives insights into the parsing problem 
for such weighted grammars. Building on the formalization of constraint 
logic grammars in the CLP scheme of [11], this formal semantics will be 
given by a quantitative version of CLP. Such a quantitative CLP scheme 
can also be valuable for CLP tasks independent of grammars. 



1 Introduction 

Constraint logic grammars (CLGs) provide a powerful formalism for complex 
logical description and efficient processing of natural language phenomena. Lin- 
guistic description and computational practice may, however, often require some 
form of graded distinctions which are not provided by such grammars. 

One such issue is the task of ambiguity resolution. This problem can be illus- 
trated for formal grammars describing a nontrivial domain of natural language 
as follows: For such grammars every input of reasonable length may receive a 
large number of different analyses, many of which are not in accord with human 
perceptions. Clearly there is a need to distinguish more plausible analyses of an 
input from less plausible or even totally spurious ones. 

* I am greatly indebted to Steven Abney, Thilo Gotz and Paul King for their valuable 
comments on this paper. Furthermore, I would like to thank Graham Katz, Frank 
Morawietz and two anonymous LACL referees for their helpful suggestions. 



This problem has successfully been addressed by the use of weighted gram- 
mars for disambiguation in regular and context-free grammars. Weighted gram- 
mars assign numerical values, or weights, to the structure-building components 
of the grammars and calculate the weight of an analysis from the weights of the 
structural features that make it up. The correct analysis is chosen from among 
the in-principle possible analyses by assuming the analysis with the greatest 
weight to be the correct analysis. This approach also allows parsing to be speeded 
up by pruning low-weighted subanalyses. 

The idea of weighted grammars recently has been transferred to highly ex- 
pressive weighted CLGs by [8, 9] and [7]. The approaches of Erbach and Eisele 
are based on the feature-based constraint formalism CUF ([5, 4]), which can 
be seen as an instance of the constraint logic programming (CLP) scheme of 
[11]. These approaches extend the underlying formalism by assigning weights 
to program clauses, but differ with respect to an interpretation of weights in a 
preference-based versus probabilistic framework. Erbach calculates a preference 
value of analyses from the preference values of the clauses used in the analyses, 
whereas Eisele assigns application probabilities to clauses from which a proba- 
bility distribution over analyses is calculated. 

There is an obvious problem with these approaches, however. Even if the 
formal foundation of the underlying framework is clear enough, there is no well- 
defined semantics for the weighted extensions. This means that these extralogical 
extensions of the deduction scheme of the underlying constraint logic program 
are not related to the model-theoretic counterpart of this operational seman- 
tics, i.e., they do not come with a formal semantics at all. This is clearly an 
undesirable state of affairs. Rather, in the same way as CLGs allow for a clear 
model-theoretic characterization of linguistic objects coupled with the opera- 
tional parsing system, one would prefer to base a quantitative deduction system 
on a clear quantitative model-theory in a sound and complete way. 

The aim of this paper is to present a clear formal semantics for weighted 
CLGs, which abstracts away from specific interpretations of weights, but gives 
insight into the parsing problem for weighted CLGs. Building on the formal- 
ization of CLGs in the CLP scheme of [11], this formal semantics will be given 
by a quantitative version of CLP. Such a quantitative CLP scheme can also be 
valuable for CLP tasks independent of grammars. 

Previous work on related topics has been confined to quantitative extensions 
of conventional logic programming. A quantitative deduction scheme based on a 
fixpoint semantics for sets of numerically annotated conventional definite clauses 
was first presented by van Emden in [26] . In this approach numerical weights are 
associated with definite clauses as a whole. The semantics of such quantitative 
rule sets is based upon concepts of fuzzy set algebra and crucially deals with the 
truth-functional "propagation" of weights across definite clauses. Van Emden's 
approach initialized research into a now extensively explored area of quantitative 
logic programming. For example, annotated logic programming as introduced by 
[25] extends the expressive power of quantitative rule sets by allowing variables 
and evaluable function terms as annotations. Such annotations can be attached 



to components of the language formula and come with more complex mappings 
as a foundation for a multivalued logical semantics. Such extended theories are 
interpreted in frameworks of lattice-based logics for generalized annotated logic 
programming ([15]), possibilistic logic for possibilistic logic programming ([6]) 
or logics of subjective probability for probabilistic logic programming ([20, 21]) 
and probabilistic deductive databases ([17, 18]). 

Aiming at a formal foundation of weighted CLGs in a framework of quan- 
titative CLP, we can start from the ideas developed in the simple and elegant 
framework of [26], but transfer them to the general CLP scheme of [11]. This 
means that the form of weighted CLGs under consideration allows us to restrict 
our attention to numerical weights associated with CLP clauses as a whole. Fur- 
thermore, the simple concepts of fuzzy set algebra can also provide a basis for an 
intuitive formal semantics for quantitative CLP. Such a formal semantics will be 
sufficiently general in that it is itself not restricted by a specific interpretation of 
weights. Further extensions should be straightforward, but have to be deferred 
to future work. Our scheme will straightforwardly transfer the nice properties of 
the CLP scheme of [11] into a quantitative version of CLP. 

2 Constraint Logic Programming and Constraint Logic 
Grammars 

Before discussing the details of our quantitative extension of CLP, some words 
on the underlying CLP scheme and grammars formulated by these means are 
necessary. In the following we will rely on the CLP scheme of [11], which gener- 
alizes conventional logic programming (see [19]) and also the CLP scheme of [12] 
to a scheme of definite clause specifications over arbitrary constraint languages. 
A very general characterization of the concept of constraint language can be 
given as follows. 

Definition 1 £ . A constraint language £ consists of 

1. an £ -signature, specifying the non- logical elements of the alphabet of the 
language, 

2. a decidable infinite set VAR whose elements are called variables, 

3. a decidable set CON of £ -constraints which are pieces of syntax with un- 
known internal structure, 

4. a computable function V assigning to every constraint <\> G CON a finite set 
V(</>) of variables, the variables constrained by <fi, 

5. a nonempty set of £ -interpretations INT, where each £ -interpretation 1 e 
INT is defined w.r.t. a nonempty set V, the domain of I, and a set ASS of 
variable assignments VAR — > £>, 

6. a function mapping every constraint <p S CON to a set [0] x of variable 
assignments, the solutions of <j> in X . 

7. Furthermore, a constraint 4> constrains only the variables in V(</>), i.e., if 
a E [</>] x and /3 is a variable assignment that agrees with a on V '(</)), then 



To obtain constraint logic programs, a given constraint language £ has to be 
extended to a constraint language TZ(£) providing for the necessary relational 
atoms and propositional connectives. 

Definition 2 1Z(£) . A constraint language 1Z(£) extending a constraint lan- 
guage £ is defined as follows: 

1. The signature of TZ(£) is an extension of the signature of £ with a decidablc 
set 1Z of relation symbols and an arity function Ar : 1Z — > IN. 

2. The variables of 1Z(£) are the variables of £ . 

3. The set of 7?.(£)-constraints is the smallest set s.t. 

— cf> is an 7?.(£)-constraint if <j> is an /^-constraint, 

— r(x) is an 7\L(£)-constraint, called an atom, if r E 1Z is a relation symbol 
with arity n and x is an n-tuple of pairwise distinct variables, 

— 0, F & G, F — > G are 7?.(£)-constraints, if F and G are 7\L(£)-constraints, 

— <p & B\ & . . . & B n — > A is an 7?.(£)-constraint, called a definite clause, 
if A, B\, . . . , B n are atoms and <p is an /^-constraint. We may write a 
definite clause also as A & _Bi & . . . & B n . 

4. The variables constrained by an 1Z(£) -constraint are defined as follows: If 
4> is an /^-constraint, then V(</>) is defined as in £ ; V(r(xi, . . . , x n )) := 
{x u x n }- V(0) := 0; V(F & G) := V(F) U V(G); V(F - G) := V(F) U 
V(G). 

5. For each £ -interpretation X , an 7?.(£)-interpretation A is an extension of an 
/^-interpretation X with relations r A on the domain £> of A with appropriate 
arity for every r E 1Z and the domain of A is the domain of X. 

6. For each 1Z(£) -interpretation A , for each £ -interpretation X , J-]- 4 is a 
function mapping every 7£(£) -constraint to a set of variable assignments 
s.t. 

— l<j)} A = if <p is an /^-constraint, 

— [r(x)]- 4 = {a G ASS| a(x) £ r- 4 }, 

— iq A = ASS, 

— [F & G]- 4 = {Fj A n [G]^, 

— [F -> G]- 4 = (ASS \ [F]- 4 ) U [G]- 4 . 

A constraint logic program then is defined as a definite clause specification 
over a constraint language. 

Definition 3 definite clause specification. A definite clause specification V 
over a constraint language £ is a set of definite clauses from a constraint language 
1Z{£) extending £ . 

Relying on terminology well-known for conventional logic programming, Hoh- 
feld and Smolka's generalization of the key result of conventional logic program- 
ming can be stated as follows: 2 First, for every definite clause specification V 

2 Further conditions for this generalization to hold are decidability of the satisfiability 
problem, closure under variable renaming and closure under intersection for the 
constraint languages under consideration. 



in the extension of an arbitrary constraint language C , every interpretation of 
C can be extended to a minimal model of V. Second, the SLD-resolution method 
for conventional logic programming can be generalized to a sound and complete 
operational semantics for definite clause specifications not restricted to Horn 
theories. In contrast to [12], in this scheme constraint languages are not required 
to be sublanguages of first order predicate logic and do not have to be inter- 
preted in a single fixed domain. This makes this scheme usable for a wider range 
of applications. Instead, a constraint is satisfiable if there is at least one interpre- 
tation in which it has a solution. Moreover, such interpretations do not have to 
be solution compact. This was necessary in [12] to provide a sound and complete 
treatment of negation as failure, which is not addressed in [11]. 

The term constraint logic grammars expresses the connection between CLP 
and constraint-based grammars. Constraint-based grammars allow for a clear 
model-theoretic characterization of linguistic objects by stating grammars as 
sets of axioms of suitable logical languages. However, such approaches do not 
necessarily provide an operational interpretation of their purely declarative spec- 
ifications. This may lead to problems with an operational treatment of dcclara- 
tively well-defined problems such as parsing. CLP provides one possible approach 
to an operational treatment of various such declarative frameworks by an em- 
bedding of arbitrary logical languages into constraint logic programs. CLGs thus 
are grammars formulated by means of a suitable logical language which can be 
used as a constraint language in the sense of [ll]. 3 

For example, for feature based grammars such as HPSG ([23]), a quite direct 
embedding of a logical language close to that of [24] into the CLP scheme of [11] is 
done in the formalism CUF ([5, 4]). This approach directly offers the operational 
properties of the CLP scheme by simply redefining grammars as constraint logic 
programs, but is questionable in losing the connection to the model-theoretic 
specifications of the underlying feature-based grammars. A different approach 
is given by [10] where a compilation of a logical language close to that of [16] 
into constraint logic programs is defined. This translation procedure preserves 
important model-theoretic properties by generating a constraint logic program 
V(G) from a feature-based grammar Q in an explicit way. 

The parsing/generation problem for CLGs then is as follows. Given a program 
V (encoding a grammar) and a definite goal G (encoding the string/logical form 
we want to parse/generate from), we ask if we can infer an answer (p of G (which 
is a satisfiable £ -constraint encoding an analysis) proving the implication <p — > G 
to be a logical consequence of V. 

3 Clearly, a direct definition of an operational semantics for specific constraint-based 
grammars is possible and may even better suit the particular frameworks. However, 
such approaches have to rely directly on the syntactic properties of the logical lan- 
guages in question. Under the CLP approach, arbitrary constraint-based grammars 
can receive a unique operational semantics by an embedding into definite clause spec- 
ifications. The main advantage of this approach is the possibility to put constraint- 
based grammar processing into the well-understood paradigm of logic programming. 
This allows the resulting programs to run on existing architectures and to use well- 
known optimization techniques worked out in this area. 



3 Quantitative Constraint Logic Programming 



3.1 Syntax and Declarative Semantics of Quantitative Definite 
Clause Specifications 

Building upon the definitions in [11], we can define the syntax of a quantitative 
definite clause specification Vf very quickly. A definite clause specification V 
in 1Z(£) can be extended to a quantitative definite clause specification Vf in 
TZ(jC) simply by adding numerical factors to program clauses. 

The following definitions are made with respect to implicit constraint lan- 
guages £ and 1Z(£) . 

Definition 4 Vf • A quantitative definite clause specification Vf in 1Z(£) is a 
finite set of quantitative formulae, called quantitative definite clauses, of the 
form: <f> & B\ & . . . & B n / — > A, where A, B\, . . . , B n arc TZ(£) -atoms, <f) is 
an £ -constraint, n > 0, / e (0, 1]. We may write a quantitative formula also as 
A<- f (j> & Bi & ... & B n . 

Such factors should be thought of as abstract weights which receive a concrete 
interpretation in specific instantiations of Vf by weighted CLGs. 

In the following the notation 1Z(£) will be used more generally to denote 
relationally extended constraint languages which possibly include quantitative 
formulae of the above form. 

To obtain a formal semantics for Vf , first we have to introduce an ap- 
propriate quantitative measure into the set-theoretic specification of 1Z(£) - 
interpretations. One possibility to obtain quantitative TZ(£) -interpretations is 
to base the set algebra of TZ(£) -interpretations on the simple and well-defined 
concepts of fuzzy set algebra (see [27]). 

Relying on Hohfeld and Smolka's specification of base equivalent 1Z(£) - 
interpretations, i.e., 1Z(£) -interpretations extending the same £ -interpretation, 
in terms of the denotations of the relation symbols in these interpretations, we 
can "fuzzify" such interpretations by regarding the denotations of their relation 
symbols as fuzzy subsets of the set of tuples in the common domain. 

Given constraint languages £ and 7Z(£) , we interpret each n-ary relation 
symbol r G 1Z as a fuzzy subset of T> n , for each 1Z(£) -interpretation A with do- 
main V. That is, we identify the denotation of r under A with a total function: 
v(-;r A ) : V n -> [0,1], which can be thought of as an abstract membership func- 
tion. Classical set membership is coded in this context by membership functions 
taking only and 1 as values. 

Next, we have to give a model-theoretic characterization of quantitative def- 
inite clauses. Clearly, any monotonous mapping could be used for the model- 
theoretic specification of the interaction of weights in quantitative definite clauses 
and accordingly for the calculation of weights in the proof-theory of quantitative 
CLP. For concreteness, we will instantiate such a mapping to the specific case 
of Definition 5 resembling [26] 's mode of rule application. This will allow us to 
state the proof-theory of quantitative CLP in terms of min/max trees which in 



turn enables strategies such as alpha/beta pruning to be used for efficient search- 
ing. However, this choice is not crucial for the substantial claims of this paper 
and generalizations of this particular combination mode to specific applications 
should be straightforward, but are beyond the scope of this paper. 

The following definition of model corresponds to the definition of model in 
classical logic when considering only clauses with f = 1 and mappings V n — > 
{0,1}. 

Definition 5 model. An 1Z(£) -interpretation A extending some £ -interpre- 
tation X is a model of a quantitative definite clause specification Pp iff for each 
a G ASS , for each quantitative formula r(x) <— y cj) & <?i(xi) & ... & qk{*k) in 
P F ■ If a e [0] x , then fj,(a(x); r A ) > f x min{fj,(a(x. j ); qf)\ 1 < j < k}. 

Note that the notation of an TZ(£) -interpretation A will be used more gen- 
erally to include interpretations of quantitative formulae. TZ(£) -solutions of a 
quantitative formula are defined as |r(x) qi(xi) & ... & qk(xk)J A = 

{a e ASS | If a e [4>f ', then fi(a{x);r A ) > / x mm{/i(a(xj); qf)\ l<j< k}}. 

The concept of logical consequence is defined as usual. 

Definition 6 logical consequence. A quantitative formula r(x) <— / <fi is a 
logical consequence of a quantitative definite clause specification Pp iff for each 
1Z(£) -interpretation A , A is a model of implies that A is a model of {r(x) <— ^ 
^}. 

Furthermore, we have that r(x) <— / is a logical consequence of "Pf implies that 
r(x) <— is a logical consequence of Pf for every /' < /. 

A goal G is defined to be a (possibly empty) conjunction of TZ(£) -atoms 
and £ -constraints. We can, without loss of generality, restrict goals to be of the 
form r(x) & 4>, i.e., a (possibly empty) conjunction of a single relational atom 
r(x) and an £ -constraint <j). This is possible as for each goal G — r"i(xi) & . . . 
& rfc(xfe) & containing more than one relational atom, we can complete the 
program with a new clause C = r(xi, . . . ,Xfe) «— i ri(xi) & ... & ^(x^) & 
with G as antecedent and a new predicate with all variables in G as arguments 
as consequent. Submitting the new predicate r(xi, . . . ,x^) as query yields the 
same results as would be obtained when querying with the compound goal G. 

Given some Pp and some goal G, a "Pp -answer ip of G is defined to be 
a satisfiable £ -constraint ip s.t. ip f — ► G is a logical consequence of Pf . A 
quantitative formula ip / — > r(x) & is defined to be a logical consequence of 
Pf iff every model of Pp is a model of {(p / — ► r(x) & 0}. An 7£(£) -interpretation 
^4 is defined to be a a model of / — > r(x) & 0} iff [t/j] -4 C and ^4 is a 

model of {r(x) <— / i/j}. 

Aiming to generalize the key result in the declarative semantics of CLP — 
the minimal model semantics of definite clause specifications over arbitrary con- 
straint languages — to our quantitative CLP scheme, first we have to associate 
a complete lattice of interpretations with quantitative definite clause specifica- 
tions. 



Adopting Zadch's definitions for set operations, we can define a partial order- 
ing on the set of base equivalent 7Z(£) -interpretations. This is done by defining 
set operations on these interpretations with reference to set operations on the 
denotations of relation symbols in these interpretations. We get for all base 
equivalent 1Z(£) -interpretations A, A': 

— A C A ' iff for each n-ary relation symbol r £ 7Z , for each a £ ASS , for 
each x £ VAR": fj,(a(x);r A ) < p(a{x.);r A ' ), 

— A = [J X iff for each n-ary relation symbol r £ 1Z , for each a £ ASS , for 
each x £ VAR": p(a(x); r A ) = sup{p{a(x); r A ')\ A ' £ X}, 

-A = f] X iff for each n-ary relation symbol r £ 7Z , for each a £ ASS , for 
each x £ VAR": fi(a(x);r A ) = inf{p(a(x);r A ' )\ A' e X}, 

— sup = 0, inf 0=1. 

Clearly, the set of all base equivalent 1Z(£) -interpretations is a complete lattice 
under the partial ordering of set inclusion. 

Next we have to apply the syntactic notions of renaming and variant to 
the quantitative case. A renaming is a bijection VAR — > VAR which is the 
identity except for finitely many exceptions and VAR is a decidable infinite set 
of variables. 

A quantitative formula k' is a p-variant of a quantitative formula k under a 
renaming p iff V(k') = p(V(«)), where V is a computable function assigning to 
every quantitative formula k the set V(/c) of variables occurring in k; k' = up, 
i.e., k' is the quantitative formula obtained from k by simultaneously replacing 
each occurrence of a variable X in k by p{X) for all variables in V(k); and 
\k\ a = \n'\ A p := {a o p\ a £ [k']" 4 } for each interpretation A . 
A quantitative formula n' is a variant of a quantitative formula k if there exists 
a renaming p s.t. k' is a p-variant of k. 

Using these definitions, we can state the central equations which link the 
declarative and procedural semantics of Vf ■ 

Definition 7. Let Vf be a quantitative definite clause specification in 1Z(C) , 
I be an £ -interpretation. Then the countably infinite sequence (Ao , A\ , Ai , ■ ■ .) 
of 1Z(£) -interpretations extending X is a Vf -chain iff for each n-ary relation 
symbol r £ 1Z , for each a £ ASS , for each x £ VAR": 

p(a(x):r Ao ) := 0, 

/i(a(x); r Ai+1 ) := max{ f x min{fi(a(xj); q Ai )\ 1 < j < n} | there is a variant 
r(x) <— / 4> & (Ji(xi) & ... & g„(x„) of a clause in and a £ I^]" 4 ' }- 

Before stating the central theorem concerning the declarative semantics of 
quantitative definite clause specifications, we have to prove the following useful 
lemma (cf. [26], Lemmata 2.10', 2.11'): 

Lemma8. For each Vf , for each Vf -chain (Ao, Ai, A2, ■ ■ ■), for each n-ary 
relation symbol r £ 1Z , for each a £ ASS , for each x £ VAR", there exists some 

n £ IN s.t. p(a{x);r^'>° A ") = p(a(n);r A "). 



Proof. We have to show that the supremum v = sup{/i(a(x); r Ai )\ i > 0} can 
be attained for some n G IN. 

v = 0: For v = 0, we have n = 0. 

w > 0: For u > 0, we have to show for any real e, < e < v: {^(a(x); 7-^)1 z > 
and fj,(a(x)]r Ai ) > e} is finite. 

Let F be the finite set of real numbers of factors of clauses in Vf , m be 
the greatest element in F s.t. m < 1 and let g be the smallest integer s.t. 
m q < e. 

Then, since each real number /i(a(x); r- 4 *) is a product of a sequence of 
elements of F, the number of different products > e is not greater than \F\ q 
(in combinatorics' talk, the permutation of \F\ different things taken q at a 
time with repetitions) and thus finite. 

Hence, the supremum is the maximum attained for some n G IN. □ 

Now we can obtain minimal model properties for quantitative definite clause 
specifications similar to those for the non-quantitative case of [11]. Theorem 9 
states that we can construct a minimal model A of Vf for each quantitative def- 
inite clause specification V-p in the extension of an arbitrary constraint language 
C and for each C -interpretation. This means that — due to the definiteness of 
Vf — we can restrict our attention to a minimal model semantics of Vf ■ 

Theorem 9 definiteness. For each quantitative definite clause specification Vf in 
TZ(C) , for each £ -interpretation! , for each Vf -chain (Ao , A\ , A2 , ■ • •} ofTZ(C) - 
interpretations extending some £ -interpretation X : 

(i) A0QA1Q..., 

(ii) the union A := Ui>o * s a m °del of Vf extending 2 , 

(iii) A is the minimal model of Vf extending X . 

Proof, (i) We have to show that Ai C Ai+\. We prove by induction on i showing 
for each constraint language £ , for each quantitative definite clause specification 
Vf in TZ(£) , for each £ -interpretation X , for each Vf -chain (Ao, Ai, A2, ■ ■ ■) of 
Ti,{£) -interpretations extending some £ -interpretation X , for each n-ary rela- 
tion symbol r 6 R, for each a € ASS , for each x G VAR", for each i £ IN: 
jti(a(x); r Ai ) < /j,(a(x); r Ai+1 ). 

Base: /i(a(x); r Aa ) = < n(a(x); r Al ). 

Hypothesis: Suppose /u(a(x); r An - x ) < /u(a(x); r An ). 

Step: /^(a(x); r An ) = v > 

===> there exists a variant r(x) <— f <j> & qi (xi) & ... & qk(*-k) of a clause 
in V F s.t. v = f x mm^fafxi)^/"- 1 ), . . . , / u(a(x fc ); i^- 4 ™- 1 )} and a G 
M" 4 "- 1 , by Definition 7 

=> ^("(xi);?!- 4 ") > ^("(xi);?!- 4 "- 1 ), 

• • • ^("(xa;);^- 4 ") > ^(^(xfe);^- 4 "- 1 ) and a G M" 4 ", by the hypothe- 
sis 



=> fi(a(x);r An+1 ) > v, by definition of /j,(a(x);r Ai+1 ) 
=> yu(a(x); r An ) < fi(a(x); r An+1 ). 

For v = follows immediately /j,(a(x); r A ™ ) < ^(a(x); r An+1 ). 
Claim (i) follows by arithmetic induction. 

(ii) We have to show that A := U«>o * s a m °del of "Pp extending X . We 
prove that for each clause r(x) <—/(/>"& <7i(xi) & ... & qk{xk) in "Pf , for each 
a £ ASS : If a e J^]" 4 , then /z(a(x); r- 4 ) > / x min{^(a(x 3 ); g^" 4 )! 1 < j < k}. 

Note that since every Ai is an 7?.(£) -interpretation extending T , A is an 7\L(£) - 
interpretation extending X . 

Now let r(x) <— ^ & & ... & qk(x-k) be a clause in "Pf s.t. for some 

a e ASS : a G J^]- 4 and /u(a(xi); f^- 4 ) = mm{/j(o(xj); ^j" 4 )! 1 < j < = v. 

Then there exists some n £ IN s.t. w = fi(a(xi); qi An ) = min{^(a(xj); qj An )\ 
1 < J < k}, by Lemma 8 and since for all j s.t. 1 < j < k : [i(a(xj): qj A ) = 
sup{^(a(x :) );^- 4 ')| i > 0} 

=> ii{a(x);r An+1 ) > f x v, by Definition 7 

/i(a(x);r- 4 ) > ^(a(x); r' 4 "+ 1 ), since /^i(a(x);r- 4 ) = 
swp{/^,(a(x); r Ai ) \ i > 0} 

==> /i(a(x); r- 4 ) > / x mzn{/i(a(xj); f^" 4 )! 1 < j < A;}. 
This completes the proof for claim (ii) . 

(iii) We have to show that A is the minimal model of Pf extending 1 . We prove 
for every base equivalent model B of Pf '■ Ai C £>, which gives A C i3, by induc- 
tion on i showing for each constraint language C , for each quantitative definite 
clause specification Vf in Tl{C) , for each £ -interpretation X , for each Vf -chain 
(•4o,Al,.<42, • • •} °f -interpretations extending some £ -interpretation Z, 
for each n-ary relation symbol r £ TZ , for each a £ ASS , for each x £ VAR™, for 
each i £ IN: ^(a(x); r Ai ) < /j,(a(x);r B ). 

Base: ^(x);?- 40 ) = < fi(a{x);r B ). 
Hypothesis: Suppose /x(a(x); r An ~ x ) < ^i(a(x); r B ). 
Step: ^(a(x);r- 4 ") = v > 

==>■ there exists a variant r(x) <—f<fr&c gi(xi) & ... & qk{xk) of a clause in 
Pf s.t. v = f x min{fi(a(xi); qi^- 1 ), . . . , /j,(a(x k ); q^"- 1 )} 
and a £ [0]" 4 " -1 , by Definition 7 

> M(a(xi);<?i- 4 — 1 ), 
. . . ,{i(a(x k );q k B ) > /x(a(x fe ); qfc- 4 "- 1 ) and a e by the hypothesis 

t>, since i3 is a model of Vf 
=> /i(a(x); r- 4 ") < /i(a(x); r B ). 



For v = follows immediately /j,(a(x);r An ) < fi(a(x);r B ). 
Claim (iii) follows by arithmetic induction. □ 

Proposition 10 allows us to link the declarative description of the desired 
output from Vf and a goal, i.e., a Vf -answer, to the minimal model semantics 
of Vf ■ This is done by connecting the concept of logical consequence with the 
concept of minimal model. 

Proposition 10. Let Vf be a quantitative definite clause specification in 1Z(£) , 
if be an C -constraint and G be a goal. Then ip „— > G is a logical consequence of 
Vf iff every minimal model A of Vf is a model of {p „ — > G}. 

Proof, if: For each minimal model A of Vf ■ A is a model of {p v — ► G} 

=^> for every base equivalent model B of Vf ■ B is a model of {ip v ^ G}, 
since A C B by Theorem 9, (iii) 

=/• p v ^> G is & logical consequence of Vf ■ 

only if: p v — > G is a logical consequence of Vf 

=^> every model of Vf is a model of {p v — > G}, by Definition 6 

=^> A is a model of {p v — > G}. □ 

The following toy example will illustrate the basic concepts of the declarative 
semantics of quantitative definite clause specifications. 

Example 1. Consider a simple program Vf consisting of clauses 1, 2 and 3. Let 
for the sake of the example be \X — cf) & X — ip\ T — for each C -interpretation 
X. 

1 p(X) <-. 7 X = </>. 

2 p(X) <-. 8 X = 4>. 
3p(X)^. 9 X = ^. 

A Vf -chain for predicate p and an object a(X) allowed by the C -constraint 
X = <fi is constructed as follows. 

li((a(X));p A °) = 0, 

fi((a(X)} ;p Al ) = max{.7 x min 0, .5 x min 0} = .7, 
H((a(X)) ;p A2 ) = max{.7 x min 0, .5 x min 0} = .7, 



The membership value of this object in the denotation of p under the minimal 
model A of Vf is attained in step 1 and calculated as follows. 

li((a(X)) ;fM>o Ai ) = sup {o, .7, .7, ...} = .! . 

Clearly, ^4 is a model of clauses 1 and 2. A similar calculation can be done 
for clause 3. 



3.2 Operational Semantics of Quantitative Definite Clause 
Specifications 

The proof procedure for quantitative CLP is a search of a tree, corresponding to 
the search of an SLD-and/or tree in conventional logic programming and CLP. 
Such a tree is defined with respect to the inference rules — ^ and — of [11] and 
a specific calculation of node values. The structure of such a tree exactly reflects 
the construction of a minimal model and thus may be defined as a min/max tree. 
In the following we will assume implicit constraint languages C and 71(C) and a 
given quantitative definite clause specification Vf in 71(C) . Furthermore, V will 
denote the finite set of variables in the query and the V-solutions of a constraint 
<f> in an interpretation Xare defined as [</>]y := {ct\v\ a 6 \<f)\ 1 } and a\v is the 
restriction of a to V. 

The first inference rule is given by a binary relation — ► , called goal reduc- 
tion, on the set of goals. 

A k G -^-> F k G A ^ F is & variant of a clause in V 
s.t. (VUV(G)) nV(F) C M(A). 

A second rule takes care of constraint solving for the /^-constraints appearing 
in subsequent goals. The rule takes the conjunction of the /^-constraints from 
the reduced goal and the applied clause and gives, via the black box of a suitable 
C- constraint solver, a satisfiable /^-constraint in solved form if the conjunction 
of /3-constraints is satisfiable. The constraint solving rule can then be defined as 
a total function — ► on the set of goals. 

4> k $ k G -±> 0" k G if {4> k 4>%UV{G) = W%uv(G) 

for each C- interpretation X and for all C -constraints <fi, </>' and <fi" . 

Definition 11 min/max tree. A min/max tree determined by a query G\ and 
a quantitative definite clause specification V? has to satisfy the following condi- 
tions: 

1. Each max-node is labeled by a goal. The value of each nonterminal max- node 
is the maximum of the values of its descendants. 

2. Each min-node is labeled by a clause from Vf and a goal. The value of each 
nonterminal min-node is / x m, where / is the factor of the clause and m is 
the minimum of the values of its descendants. 

3. The descendants of every max-node are all min-nodes s.t. for every clause C 
with — r —* -resolvent G' obtained by C from goal G in a max-node, there is a 
min-node descendant labeled by C and G' . 

4. The descendants of every min-node are all max-nodes s.t. for every 71(C) - 
atom r(x) in goal Gk(j)k(j)' in a min-node with — -resolvent Gk(f>", there 
is a max-node descendant labeled by r(x) k cf>" . 

5. The root node is a max-node labeled by G\. 

6. A success node is a terminal max-node labeled by a satisfiable 
£ -constraint. The value of a success node is 1. 



7. A failure node is a terminal max-node which is not a success node. The value 
of a failure node is 0. 

Definition 12 proof tree. A proof tree for goal G\ from Vf is a subtree of a 
min/max supertree determined by G\ and Vf and is defined as follows: 

1. The root node of the proof tree is the root node of the supertree. 

2. A max-node of the proof tree is a max-node of the supertree and takes one 
of the descendants of the supertree max-node as its descendant. 

3. A min-node of the proof tree is a min-node of the supertree and takes all of 
the descendants of the supertree max-node as its descendants. 

4. All terminal nodes in the proof tree are success nodes 0, </>',... 

s.t. (j> & (j) 1 & ... — ► if and ip is a satisfiable £ -constraint, called answer 
constraint. 

5. Values are assigned to proof tree nodes in the same way as to min/max tree 
nodes. 

To prove soundness and completeness of this generalized SLD-resolution 
proof procedure, some further concepts have to be introduced. 

First, we have to take care of renaming closure of the generalized constraint 
language 1Z(£) . A constraint language is said to be closed under renaming iff 
every constraint has a p-variant for every renaming p. Clearly, 1Z(£) is closed 
under renaming if the underlying constraint language £ is closed under renaming. 
Furthermore, for each 1Z(£) closed under renaming, for each 1Z(£) -interpretation 
A : A is a model of an TZ(£) -constraint iff A is a model of each of its variants. 

Next, we have to redefine a complexity measure for goal reduction for 
the quantitative case. This measure is crucial in proving termination of goal 
reduction and works by keying steps of the minimal model construction to steps 
of the goal reduction process. 

— The complexity of a variable assignment a for an atom r(x) in the mini- 
mal model .4 s.t. fi(a(x):r A ) > is defined as comp(a,r(x),A) := min{i\ 
fi(a(x):r A ) = fi(a(x): r A ' )}. 

— The complexity of a for goal G = T"i(xi) & ... & rk(xk) & <j> in ^4 s.t. 
a e l$] A and /x(a(xj); ri A ) > for all i : 1 < i < k is defined as 
comp(a,G, A ) := {comp(a, rj(xj), A )| 1 < i < k} where {. . .} is a mul- 
tiset. 

— The V-complexity of a for goal G = ri(xi) & ... & rfc(xfc) & in A s.t. 
a <= \4>\v ano - M a ( x »); r i A ) > for all z : 1 < i < k is defined as 
campy (a, G, A ) :— min{comp((3,G 1 A)\ (5 £ 14>} A , ^((3(^)',n A ) > for 
all i : 1 < i < k and a — (3\v} where the minimum is taken with respect to 
a total ordering on multisets s.t. M < M' iff V.t e M \ M', 3x' e M' \ M 
s.t. x < x' . 

Clearly, the constraint solving part of the deduction scheme does not affect the 
denotation or complexity of subsequent goals. 

The following proofs show that the quantitative proof procedure is sound and 
complete with respect to the above stated semantic concepts. Again, there is a 



close similarity to the corresponding statements for the non-quantitative case of 
[11]. 



Theorem 13 soundness. For each quantitative definite clause specification Pf , 
for each goal G, for each C -constraint ip: If there is a proof tree for G from 
Pf with answer constraint ip and root value v, then ip „ — ► G is a logical conse- 
quence ofVF ■ 

Proof. The result is proved by induction on the depth d of the proof tree, where 
one unit of depth is from max-node to max-node. 

Base: We know that proof trees of depth d = have to take the form of a single 
max-node labeled by a satisfiable C -constraint ip with root value 1 . Then ip 
i — > ip is a logical consequence of Pf ■ 

Hypothesis: Suppose the result holds for proof trees of depth d < n. 

Step: Let Go = r(x) & (p be a goal labeling a proof tree of depth d = n with 
answer constraint ip and root value h, 

let G' = <?i(xi) & ... & <Zfc(xfc) & & be a goal labeling the min-nodc 
obtained from Go via — ^ using the variant C' = r(x) <—f(p'& <Zi(xi) & . . . 
& 9fe(xfc) of a clause G in Pp , 

and let Gi = <?i(xi) & 0", . . . ,Gk — <Zfc(x/c) & 4>" be goals labeling max- 
nodes obtained from G' via — % . 

Then each goal Gi , . . . , G& labels a proof tree of depth d < n with re- 
spective answer constraint ipi,...,ipk and root value gi,...,gk s.t. h = 
f x min{gi, . . . , g^} and for each model A of Pf ■ lip} A = [V'l & • ■ ■ & ipk] A , 
by definition min/max tree 

=>■ V'l gi ~ * G\, . . . ,ipk g k — > Gfe are logical consequences of , by the 
hypothesis 

=S> for each model .4 of P F , for each a € ASS: [V?]" 4 C \4>"\ A and if 
a e M" 4 , then /i(a(xi); 91 - 4 ) > 51, . . . , /u(a(x fc ); i^- 4 ) > # fe , by definition 
of logical consequence 

=>■ for each model Aof P F , for each a e ASS : {ipj A C J^']- 4 and if a e 
J^]- 4 , then ^(c^x);/" 4 ) > / x mm{ / u(a(xi); fli" 4 ), . . . , ^(a(x fe ); qfe" 4 )}, 
since each model *4 of Pf is a model of G' iff A is a model of G 

==>■ for each model Aof Pf , for each a £ ASS : [V']" 4 C [</>]' 4 and if a 6 
[V?]- 4 , then fi(a(x);r A ) >h 

=> ip h —* r ( x ) & ^ is a logical consequence of Pf ■ 
The result follows by arithmetic induction. □ 

Theorem 14 completeness. Let Pp be a quantitative definite clause specifica- 
tion in P{C) , C be closed under renaming, Abe a minimal model of Pf , G be a 
goal of the form r(x) & <p, a G \<P\ A and /x(/3(x); r A ) = v s.t. v > and a = j3\v ■ 
Then there exists a proof tree for G from Pf with answer constraint ip and root 
value v and a E \<p>\ A - 



Proof. The result is proved by induction on c = compv(a, G,A). 



Base: We know that goals with complexity c = have to take the form of a satis- 
fiable C -constraint X- Then there exists a proof tree for \ from Vf consisting 
of a single max-node labeled with \ an d root value 1. 

Hypothesis: Suppose the result holds for goals with complexity c < N. 

Step: Let G = g(x) & ip, a' G [V>]$, a" G M" 4 , a' = a"\v, comp v (a' ,G ,A) 
= comp(a" ,Go,A) — N, comp(a" ,q(x),A) := i, ^i(a"(x);q A ) — h and 
h>0. 

First we observe, that /j(a"(x);q Ai ) = h, since comp(a" , q(x), A ) := i 

=>■ there exists a variant g(x) *—fip' & 91 (xi) & ... & qk(^-k) s.t. 
h = f x min{^(a(xi);qi- 4 '- 1 ), . . . , /x(a(x fe ); fe- 4 '- 1 )} 
and a" G [V'']- 4 *- 1 and (VUV(^))nV(^' & 9i(xi) & ... & « fc (x fc )) C 
V(q(x)), by definition 7 and renaming closure of 7£(£) , finite V and 
infinitely many variables in VAR 

=^ G ^> G s.t. G' = gi(xi) & ... & %(x fc ) & V" 

and = [V' & V^ly, by definition of the inference rules. 

Next, a' e Wlv, since «" e M" 4 , a" G ty']-*" 1 C [V-']" 4 , 
a" G [V & V>'K, [0 & = [0"]^ and a' = a"\ v . 

Finally, compv(ct', G , A ) < N, since compy(a', Gq, >4 ) 

< comp(a", Gq, .A ) < {«} = {comp(a", q(x), A )} = comp(a" , Go, A ) = 
compv(a', Go, -4 ) = N. 

Now we can obtain goals G\ = 9i(xi) & 0", . . . , Gk — 9fc(xfc) & 0" from G 
s.t. a' G m(«"(xi); 9i • a ) = 3i > 0, . . . , MK(x fc ); ft- 4 ) = g k > 0, 

a' = a"\v and compy(a', Gi, ^4. ) < N,. . . , compv (a', Gfc, .A ) < N. 

=>■ for each goal Gi,...,Gfe, there exists a proof tree from Vf with 
respective answer constraint Xi,---,Xfc and respective root value 
ffi = 3i, • ■ • ,9k = 9k and a' G [xi & • • ■ & Xfc]y = [x]y, by the 
hypothesis 

=>■ there exists a proof tree for Go from Vf with answer constraint x and root 
value ti = f x min{g' x , ...,g' k } = fx min{gi, ...,g k } = h and a' G [x]y- 

The result follows by arithmetic induction. □ 

Returning to our toy example, the proof procedure for quantitative definite 
clause specifications can be illustrated as follows. 

Example 2. Starting from the simple program of Example 1, a min/max tree for 
query p(X) & X — (f> and Vf is constructed as follows. 



i,x = <t>kx = 4> 

.7 x min{l} 

I c 
X = 6 



p(X) kX = <t> 
max{.7, .5, 0} 

2, X = 4>kX = 
.5 x min{l} 

c 

x = 4> 
l 




3,X = iP&X = 4> 
.9 x min{0} 

I c 
_L 





This tree contains two success branches and one failure branch (from left to 
right). The proof trees obtained from this min/max tree are as follows. 

p(X)kX = cf> 
max{.7} 

r 

i, x = <j)kx = 4> 

.7 x min{l} 



X 



p(X)&X = (f> 
max{.5} 

r 

2, X = 4>kX = (j) 
.5 x min{l} 

c 

X = <f) 
1 

Clearly, X = <f> .7— > & X = is a logical consequence of Vf ■ 

As proposed by [26], search strategies such as alpha-beta pruning (see [22]) 
can be used quite directly to define an interpreter for quantitative rule sets. The 
same techniques can be applied to a min/max proof procedure in quantitative 
CLP. In general, the amount of search needed to find the best proof for a goal, 
i.e., the maximal valued proof tree for a goal from a program, will be reduced 
remarkably by controlling the search by the alpha-beta algorithm. 



4 Quantitative CLP and Weighted CLGs 



To sum up, the quantitative CLP scheme presented above allows for a definition 
of the parsing problem (and similarly of the generation problem) for weighted 



CLGs in the following way: Given a program Vf (coding some weighted CLG) 
and a query G (coding some input string), we ask if we can infer a Vf -answer 
<p of G (coding an analysis) at a value v (coding the weight of the analysis) 
proving ip v — > G to be a logical consequence of Vf ■ The concept of weighted 
logical consequence thus can be seen as a model-theoretic counterpart to the 
operational concept of weighted inference. 

We showed soundness and completeness results for a general proof procedure 
for quantitative constraint logic programs with respect to a simple declarative 
semantics based on concepts of fuzzy set algebra. These terms in turn allow for 
a deeper characterization of the concept of weighted logical consequence: A Vf - 
answer to a query G = r(x) & cj> at value v is a satisfiable £ -constraint ip such 
that for each model A of Vf holds: If ip is satisfiable, then </> is satisfiable and 
all objects assigned to x by a solution of <p are in the denotation of r(x) at a 
membership value of at least v. 

Considering concrete instantiations and applications of this formal scheme, 
the remaining question is how to give the concept of weight an intuitive inter- 
pretation. In the following we will briefly discuss two possible interpretations of 
weighted CLGs each of which is determined by the specific aims of a specific 
application. 

One interpretation of weights is as a correlate to the degree of grammaticality 
of an analysis. In [8, 9], Erbach attempts to calculate the degree of grammat- 
icality of an analysis from the application probabilities of clauses used in the 
analysis and additional user-defined weights. 4 Regardless of the motivation for 
this specific determination of degrees of grammaticality, the choice to interpret 
weights in correspondence to degrees of grammaticality severely restricts the 
possible applications of such weighted CLGs. 

Considering for example the problem of ambiguity resolution which is also 
addressed by Erbach, we think that the concepts of preference value and degree 
of grammaticality should be clearly differentiated. As discussed in [1], the prob- 
lem of ambiguity resolution cannot be reduced to some few unrealistic examples. 
Instead, when describing a nontrivial part of natural language, grammars of the 
usual sort will produce massive artificial ambiguity where we can find grammati- 
cal readings even for the most abstruse analyses. Suppose for example a grammar 
which licenses, among many others, analyses such as 1) John believes [Peter saw 
Mary]s and 2) John believes [New York^ taxijy drivers^NP- Such a gram- 
mar would also license the analysis 3) John believes [Peters sawN Mary^NP 
(provided a noun entry for the noun reading of saw), which is clearly less pre- 
ferred than 1). Analysis 3) otherwise is not less grammatical, as we can find 
an acceptable reading (where the NP refers to the Mary associated with some 
kind of saw called a Peter saw). Degrading the weight of the rule NP — > N 

4 Erbach sketches a calculation scheme which employs a restricted summation over 
clause weights instead of a minimization as is done in our quantitative CLP scheme. 
This calculation scheme could easily be captured by our quantitative CLP scheme 
by replacing min by a restricted sum in the relevant definitions of the declarative 
semantics and accordingly of the procedural semantics of our scheme. 



TV AT (licensing multiple nominal modifications) would on the other hand also 
degrade the weight of 2), which prevents a disambiguation by an interpretation 
of weights in terms of degrees of grammaticality. 

Considering the problem of graded grammaticality, it seems necessary to 
employ richer models for a determination of degrees of grammaticality. A first 
attempt to incorporate degrees of grammaticality investigated by psycholinguis- 
ts experiments into CLGs is presented in [13, 14]. 5 Weighted CLGs interpreted 
in a serious framework of graded grammaticality then could provide a valuable 
framework for a clear procedural and declarative treatment of graded grammat- 
icality in CLGs. 

Another interpretation of weighted CLGs is possible from the viewpoint of 
probabilistic grammars. This approach has been shown to be fruitful, e.g., for the 
problem of ambiguity resolution. The simple but useful approximation adopted 
here is to assume the most plausible analysis of a string to be the most probable 
analysis of that string. 

An attempt to transfer the techniques of probabilistic context-free grammars 
(see [3]) to CLGs was presented in [7]. In this approach the derivation process of 
CLGs is defined as a stochastic process by the following stochastic model: Each 
program clause gets assigned an application probability and the probabilities of 
all clauses defining one predicate have to sum to 1. The probability of a proof 
tree is calculated as the product of the probabilities of the rules used in it. 6 In 
order to make the probabilities of proof trees as defined by the stochastic model 
constitute a proper probability distribution, an additional normalization with 
respect to the overall probability of proof trees has to be made. 7 

What is interesting about probabilistic language models is their ability to 
estimate the probabilistic parameters of the model in accord to empirical prob- 
ability distributions. Eisele attempts to estimate the probability of clauses pro- 
portional to the expected frequency of clauses in derivations. Unfortunately, 
this approach to parameter estimation is incorrect when applied to the proba- 
bilistic CLG model of Eisele. This means that the probability distribution over 
proof trees as defined by a probabilistic CLG model estimated by the expected 
clause frequency method is not in accord with the frequency of the proof trees 
in the training corpus. Similarly, when dealing with unparsed corpora, the EM- 
algorithm used for parameter estimation optimizes the wrong function when 
applied to this model. The reason for this incorrectness is that the set of trees 
generated from such a probabilistic CLG model is constrained in a way which 
violates basic assumptions made in the applied parameter estimation method. 
In other words, the probabilistic CLG model defined by Eisele could be said to 

5 Keller concentrates on experimental investigation of degrees of grammaticality and 
sketches a model of graded grammaticality based on ranked constraints. Such a model 
should easily be given a formal basis in terms of our quantitative CLP scheme. 

6 This calculation scheme also could easily be captured by our quantitative CLP 
scheme by replacing min by a product accordingly in the relevant definitions of 
the declarative and procedural semantics of our scheme. 

7 [3] discuss further conditions on consistency of probabilistic grammars which would 
have to be satisfied also by a probabilistic CLG model. 



be incorrect, in the sense that it makes an independency assumption for clause 
applications which is violated by the languages generated from such probabilistic 
CLGs. 

Since the proposed parameter estimation method is only provably correct for 
the context-free case, the probabilistic language model of Eisele faces a severe 
restriction. The only approach we know of to present a correct parameter esti- 
mation algorithm for probabilistic grammars involving context-dependencies is 
the model of stochastic attribute- value grammars of [2] , a discussion of which is 
beyond our present scope. 

5 Conclusion 

We presented a simple and general scheme for quantitative CLP. Our quanti- 
tative extension straightforwardly transferred the nice properties of the CLP 
scheme of [11] into close analogs holding for a quantitative version of CLP. With 
respect to related approaches to quantitative extensions of conventional logic 
programming, our extension raises ideas from this area to the general frame- 
work of CLP. 

We showed soundness and completeness results with respect to a declarative 
semantics based on concepts of fuzzy set algebra. This approach to a declarative 
semantics was motivated by the aim to give a clear and simple formal semantics 
for weighted CLGs. 

Clearly, more expressive quantitative extensions of CLP are possible and will 
be addressed in future work. Regarding the interest in computational linguistics 
problems such as ambiguity resolution, however, a necessary prerequisite for a 
more sophisticated semantics for probabilistically interpreted quantitative CLP 
is the development of a probabilistic model for CLP which allows for correct 
parameter estimation from empirical data. 
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